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Primordial Germ Cell Genes 

The present invention relates to genes which are expressed exclusively in the earliest 
populations of primordial germ cells (PGCs) and the use of said genes and the products 
5 thereof in identification of PGCs in cell populations. 



Introduction 



Post fertilisation, the early mammalian embryo undergoes four rounds of cleavage to 
10 form a morula of 16 cells. These cells, following further rounds of division, develop into 
a blastocyst in which the cells can be divided into two distinct regions; the inner cell 
mass, which will form the embryo, and the trophectodcrm, which will form extra- 
embryonic tissue, such as the placenta. 

15 The cells that form part of the embryo up until the formation of the blastocyst are 
totipotent; in other words, each of the cells has the ability to give rise to a complete 
individual embryo, and to all the extra-embryonic tissues required for its development. 
After blastocyst formation, the cells of the inner cell mass are no longer totipotent, but are 
pluripotent, in that they can give rise to a range of different tissues. A known marker for 

20 such cells is the expression of the enzyme alkaline phosphatase. 

Primordial germ cells (PGCs) are pluripotent cells that have the ability to differentiate 
into all three primary genu layers. In mammals, the PGCs migrate from the base of the 
allantois, through the hindgut epithelium and dorsal mesentery, to colonise the gonadal 

25 anlague. The PGC-derived cells have a characteristically low cytoplasm/nucleus ratio, 
usually with prominent nucleoli. PGCs may be isolated from the embryos by removing 
the genital ridge of the embryo, dissociating the PGCs from the gonadal anlague, and 
collecting the PGCs. The earliest PGC population is reported to consist of a cluster of 
some 40 alkaline phosphatase positive cells, found at the base of the emerging allantois. 

30 7.25 days post-fertilisation (Ginsburg etaL, (1990) Development 110:521-528). 

PGCs have many applications in modern biotechnology and molecular biology. They arc 
useful in the production of transgenic animals, where embryonic germ (EG) cells derived 
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domain. It is a member of a multigene family, and a plurality of members of the family 
have been isolated from PGCs. The GCRI polypeptide shows 89% homology to the 
interferon-inducible protein (sp: INIB RAT, pir:JC1241; GenPept Girl 11 876). It is 
considered that the latter polypeptide is the rat homologue of GCRI. GCR2 is a 150 
5 amino acid polypeptide, of approximately 18kd, with a very basic pi of 9.76. The 
polypeptide comprises a nuclear localisation signal, and is a nuclear protein. It has no 
homology to any known protein. 

The invention further provides polynucleotides which comprise a contiguous stretch of 
10 nucleotides from SRQ. ID. No. I or SEQ. ID. No. 2, or of a sequence at least 90% 
homologous thereto. Advantageously, this stretch of contiguous nucleotides is 50 
nucleotides in length, preferably 40, 35, 30, 25, 20. 15 or 10 nucleotides in length. 

The genes GCRI and GCR2 encode novel polypeptides, the sequences of which are set 
15 forth in SEQ. ID. No. 2 and SEQ. ID. No. 4. The invention therefore encompasses 
polypeptides encoded by the nucleic acids according to the invention. Preferably, the 
polypeptides have the sequences set forth in SEQ. ID. No, 2 and SEQ. ID. No. 4. 

Antibodies may be raised against the polypeptides according to the invention. In 
20 particular, antibodies may be raised against the extracellular domain of GCRI, which is a 
transmembrane polypeptide. 

Antibodies and nucleic acids according to the invention are useful for the identification of 
PGCs in cell populations. The invention therefore provides a means to isolate PGCs, 
25 useful for example for the study of germ tissue development and the generation of 
transgenic animals, and PGCs when isolated by a method according to the invention. 

Homologies of GCRI and GCR2, such as rat interferon-inducible protein, may also be 
used to identify PGCs according lo the present invention. 

30 

Moreover, the invention provides a method by which genes specifically expressed in 
PGCs may be isolated, comprising the steps of: 

a) providing a population of cells containing PGCs; 
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Although homology can also be considered in terms of similarity (i.e. amino acid residues 
having similar chemical properties/functions)., in the context of the present invention ii is 
preferred to express homology in terms of sequence identity. 

5 Homology comparisons can be conducted by eye, or more usually, with the aid of readily 
available sequence comparison programs. These commercially available computer 
programs can calculate % homology between two or more sequences. 



°/o homology may be calculated over contiguous sequences, i.e. one sequence is aligned with 
10 the other sequence and each amino acid in one sequence directly compared with the 
corresponding amino acid in the other sequence, one residue at a lime. This is called an 
"ungappecf* alignment. Typically, such ungapped alignments are performed only over a 
relatively short number of residues (for example less than 50 contiguous amino acids). 

15 Although this is a very simple and consistent method, it (ails to take into consideration that, 
for example, in an otherwise identical pair of sequences, one insertion or deletion will cause 
the following amino acid residues to be put out of alignment, thus potentially resulting in a 
large reduction in % homology when a global alignment is performed. Consequently, most 
sequence comparison methods arc designed to produce optimal alignments that take into 

20 consideration possible insertions and deletions without penalising unduly the overall 
homology score. This is achieved by inserting "'gaps*' in the sequence alignment to try to 
maximise local homology. 

1 lowcver. these more complex methods assign "gap penalties" to each gap thai occurs in the 
25 alignment so that, for the same number of identical amino acids, a sequence alignment with 
as few gaps as possible - reflecting higher relatedncss between the two compared sequences 
- will achieve a higher score than one with many gaps. "Afflne gap costs" arc typically used 
that charge a relatively high cost for the existence of a gap and a smaller penalty for each 
subsequent residue in the gap. This is the most commonly used gap scoring system. High 
30 gap penalties will of course produce optimised alignments with fewer gaps. Most alignment 
programs allow the gap penalties to be modified. However it is preferred to use the default 
values when using such software for sequence comparisons. For example when using the 
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Jn any case, however, the key feature of the sequences of the invention - namely lhat they 
are specific for PGCs and can serve as a marker for PGCs in a cell population - is retained. 



Natural variants of GCR1 and GCR2 are likely to comprise conservative amino acid 
5 substitutions. Conservative substitutions may be defined, for example according to the 
Tabic below. Amino acids in the same block in the second column and preferably in the 
same line in the third column may be substituted for each other: 



ALIPHATIC 


Non-polar 


GAP 






I L V 




Polar - uncharged 


C S T M 






N Q 




Polar - charged 


D E 






K K 


AROMATIC 




H F W Y 



10 

Polypeptides of the invention useful as markers also include fragments of the above 
mentioned full length polypeptides and variants thereof, including fragments of the 
sequences set out in SEQ. ID. No. 2 and SEQ. ID. No. 4. Preferred fragments include those 
which include an epitope. Suitable fragments will be at least about 5, e.g. 10. 12, 15 or 20 

15 amino acids in length. They may also be less than 100. 75 or 50 amino acids in length. 
Polypeptide fragments of the GCR proteins and allelic and species variants thereof may 
contain one or more (e.g. 5, 10, 15, or 20) substitutions, deletions or insertions, including 
conserved substitutions. Where substitutions, deletion and/or insertions occur, for example in 
different, species, preferably less than 50%, 40% or 20% of the amino acid residues depicted 

20 in the sequence listings are altered. 
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nucleotide and -9 for each mismatch. The default gap creation penalty is -50 and the default 
gap extension penalty is -3 for each nucleotide. 

The present invention also encompasses nucleotide sequences that are capable of hybridising 
5 selectively to the sequences presented herein, or any variant, fragment or derivative thereof, 
or to the complement of any of the above. Nucleotide sequences are preferably at least 15 
nucleotides in length, more preferably at least 20, 30, 40 or 50 nucleotides in length. 



The term ''hybridisation" as used herein shall include "the process by which a strand of 
10 nucleic acid joins with a complementary strand through base pairing" as well as the 
process of amplification as carried out in polymerase chain reaction technologies. 

Polynucleotides of the invention capable of selectively hybridising to the nucleotide 
sequences presented herein, or to their complement, will be generally at least 70%, 
15 preferably at least 80 or 90% and more preferably at least 95% or 98% homologous to the 
corresponding nucleotide sequences presented herein over a region of at least 20. preferably 
at least 25 or 30, for instance at least 40, 60 or 100 or more contiguous nucleotides. 

The term "selectively hybridisable" means that the polynucleotide used as a probe is used 
20 under conditions where a target polynucleotide of the invention is found to hybridize to the 
probe at a level significantly above background. The background hybridization may occur 
because of other polynucleotides present, for example, in the cDNA or genomic DNA 
library being screening. In this event, background implies a level of signal generated by 
interaction between the probe and a non-specific DNA member of the library which is less 
25 than 10 fold, preferably less than 100 fold as intense as the specific interaction observed with 
the target DNA, The intensity of interaction may be measured, for example, by 
radiolabelling the probe, e.g. with 32 P. 

Hybridisation conditions are based on the melting temperature (Tin) of the nucleic acid 
30 binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego CA), and 
confer a defined "stringency" as explained below. 
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10 



by conventional means using radioactive or non-radioactive labels, or the polynucleotides 
may be cloned into vectors. Such primers, probes and other fragments will be at least 15. 
preferably at least 20, for example at least 25, 30 or 40 nucleotides in length, and are also 
encompassed by the term polynucleotides of the invention as used herein. Preferred 
fragments are less than 500, 200, 100, 50 or 20 nucleotides in length. 

Polynucleotides such as a DNA polynucleotides and probes according to the invention may 
be produced recombinant^, synthetically, or by any means available to those of skill in the 
art. They may also be cloned by standard techniques. 

In general, primers will be produced by synthetic means, involving a step wise manufacture 
of the desired nucleic acid sequence one nucleotide at a time. Techniques for accomplishing 
this using automated techniques are readily available in the art. 



15 Longer polynucleotides will generally be produced using recombinant means, tor example 
using PGR (polymerase chain reaction) cloning techniques. This will involve making a pair 
of primers (e.g. of about 15 to 30 nucleotides) flanking a region of the sequence which it is 
desired to clone, bringing the primers into contact with mRNA or cDNA obtained from an 
animal or human cell, performing a polymerase chain reaction under conditions which bring 

20 about amplification of the desired region, isolating the amplified fragment (e.g. by purifying 
the reaction mixture on an agarose gel) and recovering the amplified DNA. The primers may 
be designed to contain suitable restriction enzyme recognition sites so that the amplified 
DNA can be cloned into a suitable cloning vector 



25 D. NUCLEOTIDE VECTORS 



Polynucleotides of the invention can be incorporated into a recombinant rcplicable vector. 
The vector may be used to replicate the nucleic acid in a compatible host cell. Thus in a 
further embodiment, the invention provides a method of making polynucleotides of the 
30 invention by introducing a polynucleotide of the invention into a replicable vector, 
introducing the vector into a compatible host cell, and growing the host cell under 
conditions which bring about replication of the vector. The vector may be recovered from 
the host cell. Suitable host cells include bacteria such as E. colL yeast, mammalian cell 
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The promoter is typically selected from promoters which are functional in mammalian 
cells, although prokaryotic promoters and promoters functional in other eukaryotic cells 
may be used. The promoter is typically derived from promoter sequences of viral or 
5 eukaryotic genes. For example, it may be a promoter derived from the genome of a cell in 
which expression is to occur. With respect to eukaryotic promoters, ihey may be 
promoters that function in a ubiquitous manner (such as promoters of a-actin, P-actin, 
tubulin) on alternatively, a tissue-specific manner (such as promoters of the genes for 
pyruvate kinase). They may also be promoters that respond to specific stimuli, for 
10 example promoters that bind steroid hormone receptors. Viral promoters may also be 
used, for example the Moloney murine leukaemia virus long terminal repeat (MMLV 
I TR) promoter, the Rous sarcoma virus (RSV) LTR promoter or the human 
cytomegalovirus (CMV) IE promoter. 

15 It may also be advantageous for the promoters to be inducible so that the levels of 
expression of the heterologous gene can be regulated during the life-time of the cell. 
Inducible means that the levels of expression obtained using the promoter can be 
regulated. 

20 In addition, any of these promoters may be modified by the addition of further regulatory 
sequences, for example enhancer sequences. Chimeric promoters may also be used 
comprising sequence elements from two or more different promoters described above. 

E, HOST CELLS 

25 

Vectors and polynucleotides of the invention may be introduced into host cells for the 
purpose of replicating the vectors/polynucleotides and/or expressing the proteins of the 
invention encoded by the polynucleotides of the invention. Although the proteins of the 
invention may be produced using prokaryotic cells as host cells, it is preferred to use 
30 eukaryotic cells, for example yeast, insect or mammalian cells, in particular mammalian 
cells. 

Vectors/polynucleotides of the invention may introduced into suitable host cells using a 
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the antibody in vivo or in vitro, Such labels may be radioactive labels or radioopaquc 
labels, such as metal particles, which are readily visualisable within an embryo or a cell 
mass. Moreover, they may be fluorescent labels or other labels which are visual isable on 
tissue samples, 

5 

Recombinant DNA technology may be used to improve the antibodies of the invention. 
Thus, chimeric antibodies may be constructed in order to decrease the imrnunogenicity 
thereof in diagnostic or therapeutic applications. Moreover, imrnunogenicity may be 
minimised by humanising the antibodies by CDR grafting [see European Patent 
1 0 Application 0 239 400 (Winter)] and, optionally, framework modification [EP 0 239 400]. 

Antibodies according to the invention may be obtained from animal serum, or. in the case 
of monoclonal antibodies or fragments thereof, produced in ceil culture. Recombinant 
DNA technology may be used to produce the antibodies according to established 
15 procedure, in bacterial or preferably mammalian cell culture. The selected cell culture 
system preferably secretes the antibody product. 

Therefore, the present invention includes a process for the production of an antibody- 
according to the invention comprising cukuring a host, e.g. E. coli or a mammalian cell. 
20 which has been transformed with a hybrid vector comprising an expression cassette 
comprising a promoter operably linked to a first DNA sequence encoding a signal peptide 
linked in the proper reading frame to a second DNA sequence encoding said antibody 
protein, and isolating said protein. 

25 Multiplication of hybridoma cells or mammalian host cells in vitro is carried out in 
suitable culture media, which are the customary standard culture media, for example 
Dulbecco's Modified Eagle Medium (DMEM) or RPMI 1640 medium, optionally 
replenished by a mammalian serum, e.g. foetal calf scrum, or trace elements and growth 
sustaining supplements, e.g. feeder cells such as normal mouse peritoneal exudate cells. 

30 spleen cells, bone marrow macrophages, 2-aminoethanol, insulin, transferrin, low density 
lipoprotein, oleic acid, or the like. Multiplication of host cells which are bacterial cells or 
yeast ceils is likewise carried out in suitable culture media known in the art, for example 
for bacteria in medium LB, NZCYM, NZYM, NZM, Terrific Broth, SOB. SOC. 2 x YT. 
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For isolation of the antibodies, the immunoglobulins in the culture supcrnatants or in the 
ascitic fluid may be concentrated, e.g. by precipitation with ammonium sulphate, dialysis 
against hygroscopic material such as polyethylene glycol, filtration through selective 
membranes, or the like. If necessary and/or desired, the antibodies arc purified by the 
5 customary chromatography methods, for example gel filtration, ion-exchange 
chromatography, chromatography over DEAE-cellulose and/or (immuno-) affinity 
chromatography, e.g. affinity chromatography with GCR1 or GCR2, or fragments 
thereof, or with Proiein-A. 

10 The invention further concerns hybridoma cells secreting the monoclonal antibodies of 
the invention. The preferred hybridoma cells of the invention are genetically stable, 
secrete monoclonal antibodies of the invention of the desired specificity and can be 
activated from deep-frozen cultures by thawing and rccloning. 

15 The invention also concerns a process for the preparation of a hybridoma cell line 
secreting monoclonal antibodies directed to GCR1 and/or GCR2, characterised in that a 
suitable mammal, for example a Balb/c mouse, is immunised with a one or more GCR1 
or GCR2 polypeptides, or antigenic fragments thereof; antibody-producing cells of the 
immunised mammal are fused with cells of a suitable myeloma cell line, the hybrid cells 

20 obtained in the fusion are cloned, and cell clones secreting the desired antibodies are 
selected. For example spleen cells of Balb/c mice immunised with GCR1 and/or GCR2 
are fused with cells of the myeloma cell line PAI or the myeloma cell line Sp2/0-Agl4. 
the obtained hybrid cells are screened for secretion of the desired antibodies, and positive 
hybridoma cells are cloned. 

25 

Preferred is a process for the preparation of a hybridoma cell line, characterised in that 
Balb/c mice are immunised by injecting subcutaneously and/or intraperitoneal^ between 
10 and 10 7 and 10 s cells expressing GCR1 and/or GCR2 and a suitable adjuvant several 
times, e.g. four to six times, over several months, e.g. between two and four months, and 
30 spleen cells from the immunised mice are taken two to four days after the last injection 
and fused with cells of the myeloma cell line PAI in the presence of a fusion promoter, 
preferably polyethylene glycol Preferably the myeloma cells arc fused with a three- to 
twentyfold excess of spleen cells from the immunised mice in a solution containing about 
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For the assembly of complete tetrameric immunoglobulin molecules and the expression 
of chimeric antibodies, the recombinant DNA inserts coding for heavy and light chain 
variable domains are fused with the corresponding DNAs coding for heavy and light 
5 chain constant domains, then transferred into appropriate hosL ceils, for example after 
incorporation into hybrid vectors. 



The invention therefore also concerns recombinant DNAs comprising an insert coding for 
a heavy chain murine variable domain of an antibody directed to GCR1 and/or GCR2 
10 fused to a human constant domain g, for example yl, y2, y3 or y4, preferably yl or y4. 
Likewise the invention concerns recombinant DNAs comprising an insert coding for a 
light chain murine variable domain of an antibody directed to GCR1 and/or GCR2 fused 
to a human constant domain k or X, preferably k. 



15 In another embodiment the invention pertains to recombinant DNAs coding for a 
recombinant polypeptide wherein the heavy chain variable domain and the light chain 
variable domain are linked by way of a spacer group, optionally comprising a signal 
sequence facilitating the processing of the antibody in the host cell and/or a DNA coding 
for a peptide facilitating the purification of the antibody and/or a cleavage site and/or a 

20 peptide spacer and/or an effector molecule. 

The DNA coding for an effector molecule is intended to be a DNA coding for the effector 
molecules useful in diagnostic or therapeutic applications. Thus, effector molecules 
which are toxins or enzymes, especially enzymes capable of catalysing the activation of 
25 prodrugs, are particularly indicated, The DNA encoding such an effector molecule has the 
sequence of a naturally occurring enzyme or toxin encoding DNA, or a mutant thereof, 
and can be prepared by methods well known in the art. 



H. DETECTION OF PGCs IN CELL POPULATIONS 

30 

Polynucleotide probes or antibodies according to the invention may be used for the 
detection of PGCs in cell populations. As used herein, a "cell population" is any 
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and inserted into a vector as described above, using conventional PCR cloning and 
ligation techniques. 

Antibodies may be labelled with any label capable of generating a signal. The signal may 
5 be any detectable signal, such as the induction of the expression of a detectable gene 
product. Examples of detectable gene products include bioluminescent polypeptides, 
such as luciferase and GFP, polypeptides detectable by specific assays, such as (5- 
galactosidase and CAT, and polypeptides which modulate the growth characteristics of 
the host cell, such as enzymes required for metabolism such as H1S3, or antibiotic 
10 resistance genes such as G418. In a preferred aspect of the invention, the signal is 
detectable at the cell surface. For example, the signal may be a luminescent or 
fluorescent signal, which is detectable from outside the cell and allows cell sorting by 
FACS or other optical sorting techniques. 

15 Preferred is the use of optical immunosensor technology, based on optical detection of 
fluorescently-labelled antibodies. Immuno sensors arc biochemical detectors comprising 
an antigen or antibody species coupled to a signal transducer which detects the binding of 
the complementary species (Rabbany et aL, 1994 Crit Rev Biomed Eng 22:307-346; 
Morgan et aL, 1996 Clin Chem 42:193-209). Examples of such complementary species 

20 include the antigen Zif 268 and the anti-Zif 268 antibody. Immunosensors produce a 
quantitative measure of the amount of antibody, antigen or hapten present in a complex 
sample such as serum or whole blood (Robinson 1991 Biosens Bioeleclro)i 6:183-191). 
The sensitivity of immunosensors makes them ideal for situations requiring speed and 
accuracy (Rabbany et aL, 1994 Crit Rev Biomed Eng 22:307-346). 

25 

Detection techniques employed by immunosensors include electrochemical, piezoelectric 
or optical detection of the immunointeraction (Ghindilis et aL, 1998 Biosens Bioelectron 
1:1 13-131). An indirect immunosensor uses a separate labelled species that is delected 
after binding by, for example, fluorescence or luminescence (Morgan et al. y 1996 Clin 
30 Chem 42:193-209). Direct immunosensors detect the binding by a change in potential 
difference, current, resistance, mass, heat or optical properties (Morgan et aL, 1996 Clin 
Chem 42:193-209). Indirect immunosensors may encounter fewer problems due to non- 
specific binding (Attridge et aL, 1991 Biosens Bioelecton 6:201-214; Morgan el aL, 1996 
Clin Chem 42:193-209). 



Received 



18-01-01 18:42 



From-+44 0 23 80719800 



To-THE PATENT OFFICE Page 25 



FROM: D VOUNG I CO FAX NO.: + 44 0 23 80719800 18-01-01 18:48 P. 27 

P10490GB 23 

i 

and embryonic tissues just posterior to and above the most proximal part of the primitive 
streak. The cluster persists at this position at least until Early/Mid Bud stage. In the 
inbred 129Sv strain, the PGC cluster is found to contain a slightly larger number of the 
cells, which are more tightly packaged than in the C57BL/6 strain. The 129Sv strain is 
5 used for subsequent experiments, as a better recovery of the earliest PGCs is obtained. 



129Sv embryos are isolated at E7.5 in DMEM plus 10% PCS buffered with 25mM 
HEPES at room temperature and the developmental stage of each embryo is determined 
under a dissection microscope. The precise developmental stage can differ substantially 

10 even amongst embryos within the same litter. Embryos that are at the no bud or early bud 
(allantoic) stage are chosen for further dissection, which in part is dictated by the case of 
identification of the region containing PGCs as seen under the dissection microscope. 
The fragment that is expected to contain the PGC cluster is cut out very precisely by 
means of solid glass needles. This region is dissociated it into single cells using 0.25% 

15 trypsin- ImM EGT A/PBS treatment at 37°C for 10 min, followed by gentle pipetting with 
a mouth pipette. The dissected fragment usually contained between 250-300 cells. The 
procedure for cell dispersal with this gentle procedure left the visceral endoderm layer 
remained as an intact cellular sheet. 

20 We picked single cells randomly from the cell suspension by a mouth pipette and put 
individual. single cells (but avoiding generating air bubbles), into a thin-walled PGR tube 
containing 4^1 of ice-cold cell lysis buffer (50mM Tris-HCI pH8.3 ? 75mM KCL 3mM 
MgCI 2s 0.5% NP-40, containing 80ng/ml pd(T)24 ? 5|ig/ml prime RNase inhibitor. 
324U/mi RNA guard, and lOmM each of dATP, dCTP, dGTP, and dTTP). The volume of 

25 medium carried with the single cell is less than 0.5|al. The tube is briefly centrifuged to 
ensure that the cell is indeed in the lysis buffer. During each separate experiment, we 
picked a total of 19 single cells, and left one tube without a cell, to serve as a negative 
control for the PGR amplification procedure. All the cells that are collected in tubes are 
kept on ice before starting the subsequent procedure. 

30 

The cells are lysed by incubating the tubes at 65°C for lmin, and then kept at room 
temperature for 1-2 min to allow the oligo dT to anneal the to RNA. First-strand cDNA 
synthesis is initiated by adding SOU of Moloney murine leukaemia virus (MMLV) and 
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Example 2 



Identification of PGCs by examination of the expression of marker genes 

5 The embryonic fragment which is excised theoretically contains three major components: 
the allantoic mesoderm, PGCs, and extraembryonic mesoderm surrounding PGCs. In 
order to identify the single cell cDNA of PGC origin amongst these samples, positive and 
negative selection of the constructed cDNAs is performed, by examining the expression 
of four marker genes (BMP4, TNAP, Hoxbl, and Oct4), which are known to be either 
10 expressed or repressed in various cell types in this region. 

At the No/Early Bud stage, BMP4 is reported to be expressed in the emerging allantoic 
and mesodermal components of the developing amnion, chorion, and visceral yolk sac 
(5). The boundary of BMP4 expression is very sharp, and the expression is completely 

15 excluded in the mesodermal region beneath the epithelial lining continuous from the 
amnionic mesoderm where the putative PGCs are determined. Therefore, BMP4 is used 
as a negative marker for the selection. Primer pairs are designed for amplifying the C 
terminal portion of BMP4 (5*: GCC ATA CCT TGA CCC GCA GAA G, 3*: AAA TGG 
CAC TCA GTT CAG TGG G). The PCR amplification is performed using 0.5^1 of the 

20 cDNA solution as a template according to the following schedule: 95°C for 1 min, 55°C 
for 1 min, and 72°C for 1 min for 20 cycles. Among 83 samples tested, 57 samples show 
the expected size of bands, indicating expression of BMP4 these single cells. These 
samples are considered to be of allantoic mesodermal origin, and therefore excluded from 
amongst the candidates representing cells of PGC origin. 

25 

The expression of tissue non-specific alkaline phosphatase (TNAP), which has long been 
used as an early marker for PGCs (3), is then examined. Primer pairs are designed (5': 
CCC AAA GCA CCT TAT TTT TCT ACC, 3': TTG GCG AG f CTC TGC A AT TGG) 
and the same PCR reaction as above is performed. Amongst the 26 samples, 22 samples 
30 are judged to be positive for TNAP. From the alkaline phosphatase staining of the 
sectioned embryos, it is known that the somatic cells surrounding PGCs also express 
some amount of TNAP, although the level of expression is slightly lower than that in 



Received 18-01-01 18:42 



From-+44 0 23 80719800 



To-THE PATENT OFFICE Page 29 



FROM : D VOUNG & CO 

PI 0490GB 



FAX NO.: + 44 0 23 86719800 

27 



18-01-01 13:50 P. 31 



Example 3 

Differential screening of single cell cDNA libraries 

5 As the efficiency of the amplification of cDNA differs in each tube, it is very important to 
select the samples with the most efficiently amplified cDNA for the construction of 
libraries. The amplification of six different genes (ribosomal protein S12, intermediate 
filament protein vimenlin, P tubulin-5, a actin, Oct4. E-cadherin) is examined in the 10 
PGC candidate samples, by Southern blot analysis. Judging from the overall profile of the 
10 amplification of all these six genes, three cDNA preparations are selected for the 
construction of libraries. 

To obtain the maximum amount of double strand cDNA, an extension step is performed 
with 5\i\ of cell cDNA in lOOjil of the PGR buffer described as above (including lj.il of 

15 Amplitaq) accordiag to the following schedule: 94°C for 5min. 42°C for Smin. 72°C for 
30min. The solution is extracted by phenol/chloroform treatment, and the amplified 
cDNAs are precipitated by ethanol, suspended in TE, and completely digested with 
EcoRI. The PGR primer and excess amount of dNTPs are removed by QIAGEN PGR 
Purification Kit, and all the purified cDNAs arc run on a 2% low melting agarose gel. 

20 cDNAs above 500bp are cut and purified by QIAGEN Gel Purification Kit. The purified 
cDNAs arc precipitated by ethanol and suspended in TE and ligated into X LAP II vector 
arms. The ligated vector is packaged, titercd and the ratio of the successfully ligated 
clones is monitored by amplifying the inserts with T3 and T7 primers from 20 plaques. 
More than 95% of the phage are found to contain inserts. 

25 

The representation of the three genes, ribosomal protein S12, (3 tubulin-5. Oct4, is 
quantitated by screening 5000 plaques, and the library of the best quality among the three 
(SI 2 0.62%, [3 tubulin 0.4%, Oct4 0.5%) is used for the differential screening. As a 
comparison partner with the PGG probe, one of the most efficiently amplified 
30 surrounding somatic cell cDNA (Oct4 (+), TNAP(+/-), BMP(-), and IIoxbI(+)) is 
selected by the similar Southern blot analysis. 
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The first gene, GCR1 (Germ cell restricted- 1), encodes a 137 amino acid protein. The 
best fit model of the EMBL program PrcdictProtein predicts two transmembrane 
domains, both N and C terminus ends being located outside. Database searches reveal a 
sequence match with the rat interferon-inducible protein (sp:INIfi RAT, pir:JC1241) with 
5 unknown function. The GCR1 sequence appears six times in our screen, indicating high 
level expression in PGCs. 

The second gene, GCR2, encodes a 150 amino acid protein with very basic pi (pl=9.67) 
and containing a nuclear localisation signal. This protein has no sequence match with any 
10 known proteins. EST database searches reveal that this sequence is only found in the 
preimplantation embryo and germ line (newborn ovary and female 12.5 mesonephros and 
gonad etc) ESTs, indicating the specificity of the expression in the totipotent or 
pluripotent cells. 

15 Example 4 

Identification of PGCs by screening for GCR1 and GCR2 expression 

Although PGCs are identified in Example 2 by analysis of BMP4, TNAP, Hoxbl. and 
20 Oct4, no single one of these genes can be taken as a marker for the PGC state. However, 
both GCR1 and GCR2 may be used as such in the present invention. 

The boundary of GCR2 expression in particular is well-defined, and the expression is 
substantially limited to PGCs. Therefore, GCR2 is used as a positive marker for the 

25 selection of PGCs. Primer pairs are designed for amplifying the C terminal portion of 
GCR2 (5': GCCATTCAGATGTCTCTGCAC, 3': CTCACAGCTTGAGGCTTCTAA). 
The PCR amplification is performed using 0-5jj.1 of the cDNA solution obtained from 
PGCs in Example 1 as a template according to the following schedule: 95°C for 1 min, 
55°C for 1 min, and 72°C for l min for 20 cycles. Among 83 samples tested, only those 

30 taken from PGCs show expression of GCR2. Hence, GCR2 is a positive marker for the 



PGC fate. 
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All publications mentioned in the above specification are herein incorporated by 
25 reference. Various modifications and variations of the described methods and system of 
the invention will be apparent to those skilled in the art without departing from the scope 
and spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 
30 of the described modes for carrying out the invention which are obvious to those skilled 
in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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11. A method according to claim 8, wherein the expression of the nucleic acid 
sequence is determined by detecting the protein product encoded thereby. 

12. A method according to claim II, wherein the protein product is detected by 
5 immunostaining. 



13. An antibody specific for a polypeptide according to claim 7. 

14. An antibody according to claim 13, specific for the extracellular domain of OCR1 . 

10 

15. Use of an antibody according to claim 13 or claim 14 for the identification of a 
PGC in a population of cells. 

16. A PGC when identified by a method according to any one of claims 8 to 12. 

15 

17. A method for isolating a gene specifically expressed in PGCs, comprising the 
steps of: 

a) providing a population of cells containing PGCs; 

b) isolating one or more PGCs therefrom and providing single-cell PGC isolates; 
20 c) amplifying the transcribed nucleic acid present in a single PGC; 

d) conducting a subtractive hybridisation screen to identify transcripts present in 
PGCs but not in somatic cells; and 

e) probing a nucleic acid library with one or more transcripts identified in d) to 
clone one or more genes which are specifically expressed in PGCs. 
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SEQUENCE LISTING 



SEQ. ID. No. 1 

5 GCR1 full length 

GCCGCAGAAAGGGCAGACCCGCAGCGCGCTCCATCCTTrGCCC TCCAGTGCT 
GCCITTGCTCCGCACCATGAACCACACTTCTCAAGCCTTCATCACCGC1GCCA 
GTGGAGGACAGCCCCCAAACTACGAAAGAATCAAGGAAGAATATGAGGTGG 

10 CTGAGATGGGGGCACCGCACGGATCGGCTTCTGTCAGAAC 1 ACTGTGATCAA 
CATGCCCAGAGAGGTGTCGGTGCCTGACCATGTGGTCTGGTCCCTGTTCAATA 
CACTCTTCATGAACTTCTGCTGCCTGGGCTTCATAGCCrATGCCTACrCCGTGA 
AGTCTAGGGATCGGAAGATGGTGGGTGATGTGACTGGAGCCCAGGCCTACGC 
CTCCACTGCTAAGTGCCTGAACATCAGCACCTTGGTCCTCAGCATCCTGATGG 

15 T'rGTTA'l'CACCA'lTGTTAGTGTCATCATCATTGTTCTTAACGCTCAAAACCTTC 
ACACTTAATAGAGGATTCCGACTTCCGGTCCTGAAGTGC'JTCACCCTCCGCAG 
CTGCGTCCCTCCTTGCCCCTCCCTACACGCAGGTGTAACACTCATTTATCTATC 
CACAGTGGATTCAATAAAGTGCACTTGATAACCACC 

20 SKQ. ID. No. 2 

GCR2 full length 

GGAICACAGACTGACTGCTAATTGGGTCTTGGTTTTAGGTCTT1TCAAAGACT 
25 AAGCAATCTTG1TCCGAGCTAGCTTTTGAGGCTTCTGCCCATCGCATCGCCAT 
GGAGGAACCATCAGAGAAAG TCGACCCAATGAAGGACCCTGAAACTCCTCAG 
AAGAAAGATGAAGAGGACGCTTTGGATGATACAGACGTCCTACAACCAGAA 
ACACTAGTAAAGGTCATGAAAAAGCTAACCCTAAACCCCGG'rGTCAAGCGGT 
CCGCACGCCGGCGCAGTCTACGGAACCGCATTGCAGCCGTACCTGTGGAGAA 
30 CAAGAGTGAAAAAATCCGGAGGGAAGTTCAAAGCGCCTTTCCCAAGAGAAG 
GGTCCGCACTT'rG'ITGTCGGTGCTGAAAGACCCTA'I AGCAAAGATGAGAAGA 
CTTGTTCGGATTGAGCAGAGACAAAAAAGGCTGGAAGGAAATGAGTTTGAAC 
GGGACAGTGAGCCATTCAGATGTCTCTGCACTTTCTGCCATTATCAAAGATGG 
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